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ESTROGEN RECEPTOR a REGULATED 
GENE EXPRESSION, RELATED ASSAYS AND THERAPEUTICS 



5 FIELD OF THE INVENTION 

The present disclosure relates to a plurality of genes modulated by estrogen 
or other agents, such as hormones or combinations of hormones, in various types of 
tissue. In particular, one embodiment of the disclosure relates to a plurality of genes 
which demonstrates certain patterns of expression differing qualitatively or 

10 quantitatively, with and without exposure to estrogen and/or other hormone 
compositions. The disclosure further relates to the methods of using these genes in 
identifying agents that exert at least some of the biological effects of estrogen and/or 
other agents, and to pharmaceuticals and related therapies. The disclosure further 
relates to the use of the plurality of genes in methods of monitoring, in gene chips and 

15 in kits. 

BACKGROUND OF THE INVENTION 

Estrogens exert biological effects in numerous organs throughout the body. 
The role of estrogens in reproductive biology, the prevention of postmenopausal hot 
20 flashes, and the prevention of postmenopausal osteoporosis are well established. 
Many observational studies have suggested estrogens also reduce the risk of 
development of cardiovascular disease(1), at least in part by estrogens reducing LDL 
cholesterol levels and elevating HDL cholesterol levels(2,3). More recently, estrogens 
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have been suggested to inhibit the development of colon cancer(4), inhibit the 
development of Alzheimer's disease(5), and inhibit development of cataracts (6). The 
multitude of estrogen responses matches the widespread distribution of estrogen 
receptors (ER) throughout numerous organs, with ERa expression highest in uterus, 
5 pituitary, kidney and adrenal gland and ERp expression highest in ovary, uterus, 
bladder and lung(7). While various estrogens have been profiled for biological activity, 
little is known regarding the patterns of gene expression which are responsible for 
these diverse activities. 

Thus, a need exists for the systemic analysis of the regulation by estrogen 
10 ' and/or other hormonal compositions of gene expression in various tissues and the 
identification of the plurality of differentially expressed genes. The identification of 
candidate agents that at least partially exert the same differential expression and 
development of pharmaceuticals and new treatment methods based on such agents is 
highly desirable. There also exists a need for methods of monitoring conditions and for 
15 diagnostic products, including gene chips and kits, which may be used in the 
above-described analyses. 

The embodiments provided herein relate generally to a plurality of genes, 
particularly a plurality of genes that are modulated by estrogen and/or other hormonal 
compositions in various organs, such as the uterus, kidney and pituitary gland. Such 
20 differentially expressed genes are useful in screening assays to examine the effects of 
a candidate agent on the expression of genes that are responsive to estrogen. A 
candidate agent that induces, in a given tissue, a gene expression profile that exhibits 
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one or more similarities to the gene expression profile of estrogen and/or other 
hormonal compositions, can be identified for possible use in pharmaceuticals. The 
invention also relates to the identification of estrogen responsive genes that are known 
to be associated with the inhibition of certain conditions, such as shock, 
5 postmenopausal calcium deficiencies, cardiovascular diseases, and conditions where 
there is decreased renal blood flow, such as those caused by diuretics or congestive 
heart failure. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 depicts a pattern analysis generated by a GeneChip microarray analysis. 

10 Specifically, WT or ERpKO ovariectomized mice were treated daily with vehicle or 20 
pg/kg/day E2 for six weeks. Two hours following the final dose, the mice were 
euthanized and 13 tissues removed for RNA preparation. Two independent studies 
W ere performed, with total RNA pooled from two groups of three animals for each 
condition. Gene expression was quantified by GeneChip microarrays using murine 

15 U74 sub A arrays. Data were analyzed for patterns indicating either ERct or ERp 
dependent regulation as shown. For ERa regulation, the defined search patterns 
(induction or repression) were for regulation by E2 in both wild type and ER{3KO mice, 
in both sets of mice in both studies. For ERp regulation, the defined search patterns 
were for regulation by E2 only in the wild type mice, with no change in basal 

20 expression in the ERpKO mice compared to the wild type mice. The number of genes 
in each tissue that matched the theoretical induction (t) or repression (l)patterns for 
ERa or ERp are indicated. 
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Figures 2A-2C show a series of bar graphs showing gene expression levels of known 
genes regulated in the kidney in an ERa pattern. The expression levels 
(parts-per-million) are shown for the indicated genes in WT mice treated with vehicle 
(light blue bars), WT mice treated with E2 (dark blue bars), ER0KO mice treated with 

5 vehicle (light green bars), and ER/JKO mice treated with E2 (dark green bars) using 
U74v2 subA, B, and C microarrays. Expression was measured in two independent 
sets of animals, with two groups of animals for each treatment in each study.- A gene 
name abbreviation is shown above each graph, with the corresponding Unigene 
designation shown below. The genes are graphed in approximate order of regulation 

1 0 from largest induction (CYP7B1 ) to largest repression (BHMT). 

Figure 3 shows histological sections of the kidney in in situ hybridization studies using 
antisense probes for CYP7B1, TF, STAT5A or GADD45G in ovariectomized mice 
treated with vehicle or 20//g/kg/day E2 for six weeks. No signal was detected with the 
corresponding sense probes. 

15 Figure 4 shows histological sections of the kidney in in situ hybridization using 
antisense probes for STAT5A or GAD45G in ovariectomized rats treated with vehicle 
or 20 //g/kg/day E2 for six weeks. No signal was detected with the corresponding 
sense probes. 

Figure 5A-5B show a series of graphs showing expression levels for various genes. 
20 Ovariectomized WT mice were treated with vehicle or various doses of E2 for six 
weeks. (A) Kidney gene expression values (mean ± SEM) were determined by 
real-time PCR for each individual animal and normalized for GAPDH expression. The 
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mean expression level In vehicle-treated mice was defined as 1 for each gene. (B) 
Uterine wet weights (mg) and gene expression values (mean ± SEM). 

Figures 6A-6D show a series of bar graphs depicting relative expression levels for 
various genes. Ovariectomized WT mice were treated with vehicle, 20//g/kg/day E2, 5 
5 mg/kg/day W-0292, W-0070 or propylpyrazole triol (PPT) for six. weeks. Kidney gene 
expression values were determined by real-time PCR for each individual animal and 
normalized for GAPDH expression. The mean expression level in vehicle-treated mice 
was defined as 1 for each gene. *p < 0.01 for comparison to vehicle treated animals. 

Figures 7A-7D show bar graphs depicting expression levels of intact and AAF1-ERa 
10 mRNA determined in uterus and kidney by using a real-time PCR assay specific for 
exon 3 of the mouse ERo or ER0 genes. Each graph utilizes a different scale. 
Expression levels were normalized for total RNA level to avoid GAPDH expression 
differences between kidney and uterus. 

Figures 8A-8C show a series of relative expression levels for various genes in different 
15 types of mice. This figure also presents a model for AF1 or AF2 activation for each 
gene. Ovariectomized WT mice, ERaER/3KO mice (expressing only AAF1-ERcr) or 
ERoKO mice (expressing AAF1-ERer along with ERjff) were treated for 6 weeks with 
vehicle, 10//g/kg/day E2, 10 A/g/kg/day E2 + 5 mg/kg/day ICI182780. or 5 mg/kg/day 
tamoxifen. Kidney gene expression values were determined by real-time PCR for each 
20 individual animal and normalized for GAPDH expression. The mean expression level 
in vehicle-treated WT mice was defined as 1 for each gene. *p < 0.01 for comparison 
to vehicle treated animals. A model for the requirement of AF1 or AF2 for activation of 
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each gene is shown below each graph. The change in ER shape with tamoxifen (T) 
bound denotes the alternate helix 12 conformation induced by tamoxifen compared to 
E2. CA denotes coactivators. 

SUMMARY OF EMBODIMENTS 

5 One embodiment of the disclosure relates to a plurality of genes, each of 

whom is differentially expressed in tissue cells exposed to estrogen and/or other 
hormones or combination of hormones and tissue cells without said exposure, which 
plurality comprises a first group and a second group, wherein each gene in said first 
group is differentially expressed at a higher level in said tissue cells exposed to 

10 estrogen and/or a hormone or combinations of hormones than in said tissue cells 
without said exposure, wherein each gene in said second group is differentially 
expressed at a lower level in said tissue cells exposed to estrogen and/or a hormone 
or combinations of hormones than in said tissue cells without said exposure. 
Confirmation of such expression is confirmed by real-time PCR. Such cells preferably 

15 are from the kidney, pituitary or uterus. Exposure to estrogen and/or the other 
hormones is in vivo or in vitro. The higher level and lower levels are assessed using a 
predetermined statistical significance standard based on measurements of expression 
levels. The measurements can obtained using nucleotide arrays or nucleotide filters. 

Another embodiment relates to a method for identifying an agent having the 
20 biological effect of estrogen and/or other hormones or combination of hormones, on 
gene expression in a given tissue, wherein said desired effect represents a first 
plurality of genes differentially expressed at various levels, which method comprises: 
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exposing, in vivo or in vitro, tissue cells to said agent; 

measuring expression levels of a multiplicity of genes in said tissue cells exposed to 
said agent and tissue cells without said exposure, said multiplicity being greater than 
said first plurality; 

5 determining, using a predetermined statistical significance standard, genes which are 
differentially expressed in said tissue cells exposed to said agent and said tissue cells 
without said exposure, said genes constitute a second plurality; and 

comparing the expression levels of genes in said second plurality with the expression 
levels of genes in said first plurality, 

10 wherein said agent is identified as having said desired effort if said first and second 
pluralities are the same and said expression levels in said first and second pluralities 
are substantially the same. The tissue preferably is kidney, uterus or pituitary tissue. 
Expression levels are confirmed by real-time PGR. 

Another embodiment is directed to an agent identified by the above method. 

15 Another embodiment is a pharmaceutical composition comprising this agent 

and a pharmaceutical^ acceptable excipient. 

Another embodiment relates to a method for identifying an agent capable of 
maintaining vascular volume in septic shock, which method comprises: 

exposing, in vivo or in vitro, kidney cells to the agent; 
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measuring expression levels of NTT73 and ABCC3 in said kidney cells exposed to the 
agent and kidney cells without the exposure; 

comparing the expression levels of NTT73and ABCC3 with the expression levels of 
genes in the plurality if genes described above with regard to the kidney, wherein the 
5 induced genes are NTT73 and ABCC3, 

wherein said agent is identified as capable of maintaining vascular volume in septic 
shock if said expression levels of NTT73 and ABCC3 are substantially the same as 
said expression levels of genes in such plurality. 

Another embodiment relates to a method of identifying an agent capable of 
10 enhancing calcium uptake in post-menopausal women, which method comprises: 

exposing, in vivo or in vitro, kidney cells to said agent; 

measuring expression levels of CYP7B1 in said kidney cells exposed to said agent 
and kidney cells without said exposure; 

comparing the expression levels of CYP7B1 with the expression levels of genes in the 
15 plurality of genes in the kidney is induced CYP7B1 , 

wherein said agent is identified as capable of enhancing calcium uptake in 
post-menopausal women if said expression levels of CYP7B1 are substantially the 
same as said expression levels of genes in such plurality. 

Another embodiment relates to a method for identifying an agent for treating 
20 cardiovascular disorders, which method comprises: 



WO 03/087336 



PCT7US03/11240 



exposing, in vivo or in vitro, kidney cells to said agent; 

measuring expression levels of BHMT and SAHH in said kidney cells exposed to said 
agent and kidney cells without said exposure; 

comparing the expression levels of BHMT and SAHH with the expression levels of 
5 genes in the plurality of genes, wherein in the kidney BHMT and SAHH are repressed, 

wherein said agent is identified for treating cardiovascular disorders if said expression 
levels of BHMT and SAHH are substantially the same as said expression levels of 
genes in the such plurality. 

Another embodiment relates to agents identified by any of the above methods 
10 and pharmaceutical agents comprising such agents and a pharmaceutically 
acceptable excipient. 

Another embodiment relates to a solid substrate comprising any of the above 
described plurality of genes. 

Another embodiment relates to a kit comprising any of the above plurality of 

15 genes. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

The embodiments herein provide a plurality of genes modulated by estrogen 
and/or other hormonal compositions of interest in various types of tissue and the use of 
such a plurality of differentially expressed genes in screening for agents that exert at 
20 least some of the biological effects of estrogen and other hormonal compositions of 
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interest. Such identified agents can be used in pharmaceuticals and in related new 
therapeutic methods. The plurality of genes can be used in methods of monitoring. 

Definitions: 

In general, " a gene " is a region on the genome that is capable of being 
5 transcribed to an RNA that either has a regulatory function, a catalytic function and/or 
encodes a protein. A gene typically has introns and exons, which may organize to 
produce different RNA splice variants that encode alternative versions of a mature 
protein. "Gene" contemplates fragments of genes that may or may not represent a 
functional domain. 

10 A " plurality of genes " as used herein refers to a group of identified or isolated 

genes whose levels of expression vary in different tissues, cells or under different 
conditions or biological states. The different conditions may be caused by exposure to 
certain agent(s) - whether exogenous or endogenous - which include hormones, 
receptor ligands, chemical compounds, etc. The expression of a plurality of genes 

15 demonstrates certain patterns. That is, each gene in the plurality is expressed 
differently in different conditions or with or without exposure to a certain endogenous or 
exogenous agents. The extent or level of differential expression of each gene may 
vary in the plurality and may be determined qualitatively and/or quantitatively 
according to this invention. A gene expression profile, as used herein, refers to a 

20 plurality of genes that are differentially expressed at different levels, which constitutes 
a "pattern" or a "profile." As used herein, the term "expression profile," "profile," 
"expression pattern," "pattern," "gene expression profile," and "gene expression 
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pattern" are used interchangeably. 

An " agent that exerts at least some of the biological effects of estrogen ." 
as used herein refers to any factor, agent, compound whether endogenous or 
exogenous in origin, which is capable of binding and interacting with estrogen 
5 receptors and thereby eliciting certain biological effects of extrogen. The skilled 
artisan would know that, for instance, one of the biological effects of estrogen is to 
promote the development of the female reproductive system. Other biological effects 
of estrogen are well documented and discussed, infra. 

Gene expression profiles may be measured, according to this invention, by 
1 0 using nucleotide or microarrays. These arrays allow tens of thousands of genes to be 
surveyed at the same time. 

" Hormones or combinations of hormones " include for instance, 
combinations of estrogens or other hormones that are known to exert biological effects 
of estrogen. 

15 As used herein, the term " microarrav" refers to nucleotide arrays that can be 

used to detect biomolecules, for instance to measure gene expression. "Array," "slide," 
and "chip" are used interchangeably in this disclosure. Various kinds of arrays are 
made in research and manufacturing facilities worldwide, some of which are available 
commercially There are, for example, two main kinds of nucleotide arrays that differ in 

20 the manner in which the nucleic acid materials are placed onto the array substrate: 
spotted arrays and in situ synthesized arrays. One of the most widely used 
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oligonucleotide arrays is GeneChip™ made by Affymetrix, Inc. The oligonucleotide 
probes that are 20- or 25-base long are synthesized in silico on the array substrate. 
These arrays tend to achieve high densities (e.g., more than 40,000 genes per cm 2 ). 
The spotted arrays, on the other hand, tend to have lower densities, but the probes, 
5 typically partial cDNA molecules, usually are much longer than 20- or 25-mers. A 
representative type of spotted cDNA array is LifeArray made by Incyte Genomics. 
Pre-synthesized and amplified cDNA sequences are attached to the substrate of these 
kinds of arrays. 

In one embodiment, the nucleotide is an array (i.e., a matrix) in which each 
10 position represents a discrete binding site for a product encoded by a gene (e.g., a 
protein or RNA), and in which binding sites are present for products of most or almost 
all of the genes in the organism's genome. In one embodiment, the "binding site" 
(hereinafter, "site") is a nucleic acid or nucleic acid analogue to which a particular 
cognate cDNA can specifically hybridize. The nucleic acid or analogue of the binding 
15 site can be, e.g., a synthetic oligomer, a full-length cDNA, a less-than full length cDNA, 
or a gene fragment. 

Although the microarray may contain binding sites for products of all or almost 
all genes in the target organism's genome, such comprehensiveness is not necessarily 
required. Usually the microarray will have binding sites corresponding to at least about 
20 50% of the genes in the genome, often at least about 75%, more often at least about 
85%, even more often more than about 90%, and most often at least about 99%. 
Preferably, the microarray has binding sites for genes relevant to the action of the gene 
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expression modulating agent of interest or in a biological pathway of interest. 

The nucleic acid or analogue are attached to a " solid support ," which may be 
made from glass, plastic (e.g., polypropylene, nylon), polyacrylamide, nitrocellulose, or 
other materials. A preferred method for attaching the nucleic acids to a surface is by 
5 printing on glass plates, as is described generally by Schena et al., 1995, Quantitative 
monitoring of gene expression patterns with a complementary DNA microarray, 
Science 270:467-470. This method is especially useful for preparing microarrays of 
cDNA. See also DeRisi et al., 1996, Use of a cDNA microarray to analyze gene 
expression patterns in human cancer, Nature Genetics 14:457-460; Shalon et al„ 
10 1996, A DNA microarray system for analyzing complex DNA samples using two-color 
fluorescent probe hybridization, Genome Res. 6:639-645; and Schena et al., 1995, 
Parallel human genome analysis; microarray-based expression of 1000 genes, Proc. 
Natl. Acad. Sci. USA 93: 1 0539-1 1 286. 

In a preferred embodiment, the microarray is a high-density oligonucleotide 
15 array, as described above. In a particularly preferred embodiment, the nucleotide 
arrays are the MG JJ74 and MG_U74v2 arrays from Affymetrix. 

" Polymerase Chain Reaction" or " PCR" is an amplification-based assay 
used to measure the copy number of the gene. In such assays, the corresponding 
nucleic acid sequences act as a template in an amplification reaction. In a quantitative 
20 amplification, the amount of amplification product will be proportional to the amount of 
template in the original sample. Comparison to appropriate controls provides a 
measure of the copy number of the gene, corresponding to the specific probe used, 
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according to the principle discussed above. Methods of " real-time quantitative PCR " 
using Taqman probes are well known in the art. Detailed protocols for real-time 
quantitative PCR are provided, for example, for RNA in: Gibson et aL, 1996, A novel 
method for real time quantitative RT-PCR. Genome Res. 1 0:995-1 001 ; and forDNA in: 
5 Heid et aL, 1996, Real time quantitative PCR. Genome Res. 10:986-994. 

A TaqMan-based assay can also be used to quantify polynucleotides. TaqMan 
based assays use a fluorogenic oligonucleotide probe that contains a 5' fluorescent 
dye and a 3* quenching agent. The probe hybridizes to a PCR product, but cannot 
itself be extended due to a blocking agent at the 3' end. When the PCR product is 
10 amplified in subsequent cycles, the 5' nuclease activity of the polymerase, for example, 
AmpliTaq, results in the cleavage of the TaqMan probe. This cleavage separates the 5' 
fluorescent dye and the 3' quenching agent, thereby resulting in an increase in 
fluorescence as a function of amplification (see, for example, 
http://www2.perkin-elmer.com). 

15 Other suitable amplification methods include, but are not limited to, ligase 

chain reaction (LCR) (see, Wu and Wallace, 1989, Genomics 4: 560; Landegren et aL, 
1988 Science 241: 1077; and Barringer et aL, 1990, Gene 89: 117), transcription 
amplification (Kwoh et aL, 1989, Proc. Natl. Acad. ScL USA 86: 1173), self-sustained 
sequence replication (Guatelli et aL, 1990, Proc. Nat Acad. ScL USA 87: 1874), dot 

20 PCR, and linker adapter PCR, etc. 

The " level of mRNA" in a biological sample refers to the amount of mRNA 
transcribed from a given gene that is present in a cell or a biological sample. One 
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aspect of the biological state of a biological sample (e.g. a cell or cell culture) usefully 
measured in the present invention is its transcriptional state. The transcriptional state 
of a biological sample includes the identities and abundances of the constituent RNA 
species, especially mRNAs, in the cell under a given set of conditions. Preferably, a 
5 substantial fraction of all constituent RNA species in the biological sample are 
measured, but at least a sufficient fraction is measured to characterize the action of an 
agent or gene modulator of interest. The level of mRNA may be quantified by methods 
described herein or may be simply detected, by visual detection by a human, with or 
without comparison to a level from a control sample or a level expected of a control 
10 sample. 

A " biological sample, " as used herein refers to any sample taken from a 
biological subject, in vivo or in situ. A biological sample may be a sample of biological 
tissue, or cells or a biological fluid. Biological samples may be taken, according to this 
invention, from any kind of biological species, any types of tissues, and any types of 
15 cells, among other things. Cell samples may be isolated cells, primary cell cultures, or 
cultured cell lines according to this invention. Biological samples may be combined or 
pooled as needed in various embodimets. Preferred tissues are from the uterus, 
kidney, pituitary glands, breast, brain and adipose tissue. 

" Modulation of gene expression, " as this term is used herein, refers to the 
20 induction or inhibition of expression of a gene. Such modulation may be assessed or 
measured by assays. Typically, modulation of gene expression may be caused by 
endogenous or exogenous factors or agents. The effect of a given compound can be 
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measured by any means known to those skilled in the art. For example, expression 
levels may be measured by PCR, Northern blotting, Primer Extension, Differential 
Display techniques, etc. 

" Induction of expression " as used herein refers to any observable or 
5 measurable increase in the levels of expression of a particular gene, either 
qualitatively or quantitatively. The measurement of levels of expression may be 
carried out according to this invention using any techniques that are capable of 
measuring RNA transcripts in a biological sample. Examples of these techniques 
include, as discussed above, PCR, TaqMan, Primer Extension, Differential display and 
10 nucleotide arrays, among other things. 

" Repression of expression. " "Repression" or "inhibition" of expression, are 
used interchangeably according to this disclosure. It refers to any observable or 
measurable decrease in the levels of expression of a particular gene, either 
qualitatively or quantitatively. The measurement of levels of expression may be 
15 carried out using any techniques that are capable of measuring RNA transcripts in a 
biological sample. The examples of these techniques include, as discussed above, 
PCR, TaqMan, Primer Extension, Differential Display, and nucleotide arrays, among 
other things." 

A " gene chip" or "DNA chip" is described, for instance, in U.S. Patent Nos. 
20 5,412,087, 5,445,934 and 5,744,305 and is useful for screening gene expression at 
the mRNA level. Gene chips are commercially available. 
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A "kit" is one or more of containers or packages, containing at least one 
"plurality of genes," as described above, on a solid support. Such kit also may contain 
various reagents or solutions, as well as instructions for use and labels. 

A " detectable label" or a " detectable moiety" is a composition that when 
5 linked with a nucleic acid or a protein molecule of interest renders the latter detectable, 
via spectroscopic, photochemical, biochemical, immunochemical, or chemical means. 
For example, useful labels include radioactive isotopes, magnetic beads, metallic 
beads, colloidal particles, fluorescent dyes, electron-dense reagents, enzymes, biotin, 
digoxigenenin or haptens. A "labeled nucleic acid or oligonucleotide probe" is one that 
10 is bound, either covalently, through a linker or a chemical bond, or noncovalently 
through ionic, vander Waals, electrostatic, hydrophobic interactions, or hydrogen 
bonds, to a label such that the presence of the nucleic acid or probe may be detected 
by detecting the presence of the label bound to the nucleic acid or probe. 

A " nucleic acid probe " is a nucleic acid capable of binding to a target nucleic 
1 5 acid or complementary sequence through one or more types of chemical bond, usually 
through complementary base pairing usually through hydrogen bond formation. As 
used herein, a probe may include natural (i.e., A, G, C, or T or modified bases 
(7-deazaguanosine, inosine, etc.). In addition, the bases in a probe may be joined by a 
linkage other than a phosphodiester bond, so long as it does not interfere with 
20 hybridization. It will be understood by one of skill in the art that probes may bind target 
sequences that lack complete complementarity with the probe sequence depending 
upon the stringency of the hybridization conditions. The probes are preferably directly 
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labeled with isotopes, for example, chromophores, luminphores, chromogens, or 
indirectly labeled with biotin to which a strepavidin complex may later bind. By 
assaying the presence or absence of the probe, one can detect the presence or 
absence of a target gene of interest. 

5 " In situ hybridization " is a methodology for determining the presence of or 

the copy number of a gene in a sample, for example, fluorescence in situ hybridization 
(FISH) (see Angerer, 1987 Meth. Enzymol 152: 649). Generally, in situ hybridization 
comprises the following major steps: (1) fixation of tissue or biological structure to be 
analyzed; (2) prehybridization treatment of the biological structure to increase 

1 0 accessibility of target nucleic acid, and to reduce nonspecific binding; (3) hybridization 
of the mixture of nucleic acids to the nucleic acid in the biological structure or tissue; 
(4) post-hybridization washes to remove nucleic acid fragments not bound in the 
hybridization, and (5) detection of the hybridized nucleic acid fragments. The probes 
used in such applications are typically labeled, for example, with radioisotopes or 

15 fluorescent reporters. Preferred probes are sufficiently long, for example, from about 
50, 100, or 200 nucleotides to about 1000 or more nucleotides, to enable specific 
hybridization with the target nucleic acid(s) under stringent conditions. 

Hybridization protocols suitable for use with the methods of the invention are 
described, for example, in Albertson (1984) EMBOJ. 3:1227-1234; Pinkel (1988) Proc. 
20 Natl. Acad. Sci. USA 85:9138-9142; EPO Pub. No. 430:402; Methods in Molecular 
Biology, Vol. 33: In Situ Hybridization Protocols, Choo, ed., Humana Press, Totowa, NJ 
(1994); etc. 
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" A predetermined statistical significance standard based on 
measurements of expression levels" is a confidence score based upon the 
assessment of four factors. Specifically, a score is assigned to each gene that reflects 
the confidence of the change. The score is based on four criteria: Fold Change, 
5 p-value (T-test), Present Calls, Frequency Value. For example, see the Table below: 

TABLE I - Confidence criteria 

Fold Change Score 

>2.0 5 

>1.5 0 
10 <1.5 -3 

pValue Score 

<0.05 3 

0.05 to 0.1 2 

0.1 to 0.2 0 
15 0.2 to 0.3 -1 

0.3 to 0.5 -3 

Present Calls Score 

2-4 3 

11 
20 0 0 
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Second Largest Frequency 
>20 3 
15 to 191 
10-14 -1 
5 <10 -3 

Outliers (Max Frea/2 nd largest) 
>2.5 -3 

" Data " refers to information obtained that relates to the expression of genes in 
response to exposure to estrogen or an agent of unknown biological effect. The 

10 plurality of genes identified by the disclosed methods are examples of such 
information. The information is stored in electronic or paper formats. Electronic format 
can be selected from the group consisting of electronic mail, disk, compact disk (CD) 
digital versatile disk (DVD), memory card, memory chip, ROM or RAM, magnetic 
optical disk, tape, video, video clip, microfilm, internet, shared network, shared server 

15 and the like; wherein data is displayed, transmitted or analyzed via electronic 
transmission, video display, telecommunication, or by using any of the above stored 
formats; wherein data is compared and compiled at the site of sampling specimens or 
at a location where the data is transported following a process as described above. 

"Genes modulated by estrogen." Genes regulated by estrogen and/or 
20 hormonal compositions and identified according to the disclosed methods, are listed in 
Tables II, III and IV. Relevant Unigene codes or Genbank accession numbers are 
provided. 
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Identification of Genes Modulated by Estrogen 
A. Biological Sample and Assay 

One embodiment disclosed herein relates to a plurality of genes, each of 
which is differentially expressed in kidney cells exposed to estrogen or a candidate 
5 agent and kidney cells without exposure to estrogen or a candidate agent, which 
plurality comprises a first group and a second group, wherein each gene in said first 
group is differentially expressed at a higher level in said kidney cells exposed to 
estrogen or a candidate agent than in said kidney cells without said exposure, wherein 
each gene in the second group is differentially expressed at a lower level in said kidney 
1 0 cells exposed to estrogen or candidate agent than in said kidney cells without said 
exposure. 

A biological sample of kidney cells are obtained according to methods well 
known to the skilled artisan. One group of kidney cells are exposed to estrogen. Such 
estrogen may be 17IJ estradiol. The kidney cells may be from one or more animals of 

1 5 the same species or from a culture of kidney cells or kidney tissue. Preferably, such 
cells are from a mammal, most preferably a mouse, rat or human. Such animal must 
produce little or no estrogen. For instance, an aromatase knockout animal cannot 
produce estrogen. Because the major source of circulating estrogen is the ovary, 
ovariectomy dramatically decreases circulating estrogen levels. Thus, in one 

20 embodiment, ovariectomized animals are used. By "exposure" is meant a type and 
quantity of either in vivo or in vitro administration that is applicable to the source of the 
kidney cells and known and acceptable to those of skill of the art. The total RNA from 
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such cells is prepared by methods known to the skilled artisan, e.g., by Trizol 
(Invitrogen) followed by subsequent repurification, e.g. via Rneasy columns (Qiagen). 
The total RNA is used to generate a labeled target according to methods and using 
detectable labels well know in the art, as described above in detail. For instance, the 
5 RNA may be labeled with biotin to form a cRNA target for use in an assay. See a 
complete description of preferred methods in theAffymetrix GeneChip® technical 
manual (Pages 700217 through 700223), which is herein incorporated by reference. 

The assay, according to the invention, may be any assay suitable to detect 
gene expression. For instance, mRNA, cDNA or protein expression may be detected. 

1 0 Many different types of assays are known, examples of which are set forth above, 
including analyses by nucleotide arrays and nucleotide filters. The hybridization 
conditions (temperature, time, and concentrations) are adjusted according to 
procedures also well known in the art, as described above. In a preferred embodiment, 
the assay of the invention involves the use of a high density oligonucleotide array. For 

1 5 instance, in a preferred embodiment, cRNA labeled with biotin is hybridized to a murine 
MG_U74Av2 probe array (Affymetrix, Santa Clara, Ca.) for 16 hours at 45 degrees. 
Eleven biotin-labeled cRNAs at defined concentration are spiked into each 
hybridization and used to convert average difference values to frequencies expressed 
as parts per million. 

20 Other solid supports and microarrays are known and commercially available 

to the skilled artisan, as described above. 
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B. Measurements and Statistical Analysis 

The assay of the invention is used to identify genes modulated by estrogen. 
Such modulation may be induction of expression (a plurality of genes belonging to a 
"first group") or repression of such expression (a plurality of genes belonging to a 
5 "second group"). Gene expression induction is indicated by a higher level of 
expression, whereas repression is indicated by a lower level of expression, as 
assessed using a predetermined statistical significance standard based on 
measurements of expression levels. 

Thus, the genes expressed or repressed in kidney cells with estrogen 
1 0 exposure are compared to the genes expressed or repressed in kidney cells that were 
not exposed to estrogen. Pairwise comparisons are made between each of the 
treatments. A pairwise comparison is the expression data for a given gene under a 
given treatment condition compared to the expression data for this gene under a 
second treatment condition. The fold change ratio is then calculated, the p-value 
15 based on Student's t-test, the number of present calls, and the expression level for 
each comparison. A confidence score "CS" is defined as CS(x)= FC(x) + PV(x) + 
PC(x) + EL(x) where FC, PV, PC and EL are scores assigned to the fold change, 
p-value, number of present calls, and the expression level, respectively. FC(x) is 
assigned 5 if the fold change ratio was greater then 1 .95 and is assigned 0 if the ratio is 
20 between 1.95 and 1.5. PV(x) is assigned 3 if the p-value is less then 0.05 and is 
assigned 2 if the p-value was between 0.05 and 0.1. PC(x) is assigned 3 if at least 
50% of the samples are called P by the Affymetrix algorithm and assigned 1 if only 
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25% of the samples are called R EL(x) is assigned 3 if at least two samples have 
frequency value of 20 or greater and assigned 1 if two samples only have a frequency 
greater then 15. Penalty points are assigned if the fold change is less then 1.5, the 
p-value is greater then 0.2 or the frequency values were below 15 ppm. CS(x) ranged 
5 for -14 to 14 with qualifiers having a score of 14 considered the most significant 
changes. Genes with 11 or more points in any one pairwise comparison is considered 
to be significant. Real-time PCR and histology analyses are then performed to confirm 
the identity of the genes, essentially as described previously (9,10), which are herein 
incorporated by reference. The above described analysis can be used to identify 
10 candidate agents that are "estrogen-like" in that they have a differential expression 
profile which is in the most preferred embodiment substantially the same as estrogen's. 
For instance, in one embodiment, the expression levels for the genes upon exposure 
to the respective compounds is at least within 50% of each other. 

C. Biological Samples from other Organs 

15 The above described methods, assay and analysis can be applied to 

biological samples from any tissue, including the uterus, pituitary gland, liver, brain, 
colon, breast, adipose tissue, etc. In preferred embodiments, the biological samples 
are from the kidney, pituitary gland and the uterus. 

D. The Plurality of Genes 

20 Pursuant to the above described methods, the genes listed in Table II were 

identified as being differentially expressed upon exposure to estrogen. Genes in which 
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expression is induced by estrogen are considered to be genes of the "first group," 
whereas genes that are repressed by estrogen are considered to be in the "second 
group". 

Specifically, the estrogen modulated genes in the kidney are Tissue Factor, 
5 CYP7B1, BCAT1, STAT5A, GADD45G, BHMT, SAHH, NTT73, ABCC3. Of these 
genes, estrogen induced expression in all but BHMT and SAHH, where it repressed 
expression. 

Thus, one disclosed embodiment is a plurality of genes, wherein in the first 
group, where gene expression in kidney cells is induced by estrogen exposure, the 
1 0 plurality of genes comprise NTT73 and ABCC3. Another disclosed embodiment is a 
plurality of genes wherein the "first group" comprises CYP7B1 in kidney cells. In 
another embodiment, the plurality of genes of the "second group," where gene 
expression in kidney cells is repressed by estrogen exposure, comprises at least 
BHMT and SAHH. 

15 Another disclosed embodiment is directed to a plurality of genes in kidney 

cells, wherein the first group comprises Tissue Factor, CYP7B1 , BCAT1 , STAT5A, and 
GADD45G, and wherein said second group comprises BHMT 

Another disclosed embodiment is directed to a plurality of the genes wherein 
the first group comprises CYP7B1, TF, SCYA28, Iga, Vk28, PHD 2, ELF 3, TIM1 , 
20 STAT5A, COR1, BCAT1, ABCC3, TIM2, NAT6, RGS3, GNBP3, BCL7A, 17pDHH, 
FYVE ZFP, NTT73, AGPS, TR1M2, HBACH, CIS2, CYP27B1, and STAT5B, wherein 
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said second group comprises SAHH, ADH1A7, RARRES2, and BHMT. Another 
disclosed embodiment relates to a plurality of genes, wherein the first group comprises 
COR1 and GNBP3. 

The estrogen modulated genes in the pituitary gland are STAT5B, GADD45G, 
5 Kallikrein-9, and FSHb, the expression of which is induced by estrogen for all but 
FSHb, which is repressed. 

Thus one embodiment relates to a plurality of genes in the pituitary gland, 
wherein the first group comprises STAT5B and GADD45G. 

Another embodiment relates to a plurality of genes, wherein the first group 
10 comprises STAT5B, GADD45G1 and Kallikreins genes in the pituitary. 

Yet another embodiment relates to a plurality of genes, wherein the second 
group of genes in the pituitary gland comprise FSHb. 

Pursuant to the above methods, the inventors discovered that the estrogen 
modulated genes in the uterus comprise SFRP4, Deiodinase ( type II), Procollagen 
15 ( type I, Alpha I), vimentin and IGFBP4, Scavenger receptor, AI121305, ALOX15, 

BCAT1, SiAMOX, C3, FOS, MAP2k1, CEBPb, EGR1 and CYP1A1. All of these genes 
are induced by estrogen in the uterus except for Scavenger receptor and CYP1A1, 
which are repressed. 

Thus, one embodiment is directed to a plurality of genes in the uterus, 
20 wherein the first group comprises SFRP4, Deiodinase (type II), Procollagen ( type I, 
Alpha I), vimentin and IGFBP4. 
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Another embodiment of the invention is directed to the plurality of genes, 
wherein the first group in the uterus comprises AM 21 305, ALOX15, BCAT1, SiAMOX, 
C3, FOX, MAP2k1, CEBPb and EGR1. 

Another embodiment is directed to a plurality of genes wherein the first group 
5 in the uterus comprises SFRP4, Deiodinase ( type II), Procollagen ( type I, Alpha I), 
vimentin and IGFBP4, Scavenger receptor, AM 21 305, ALOX15, BCAT1, SiAMOX, C3, 
FOS, MAP2k1 , CEBPb and EGR1 . 

In another embodiment, the plurality of genes in the second group in the 
uterus comprises CYP1 A1 . 

10 In yet another embodiment, the plurality of genes in the second group in the 

uterus comprises Scavenger receptor. 

Methods of identifying agents 

Based upon the above described methods for determining differential 
expression of genes in various organs, another aspect of the invention relates to the 

15 identification of candidate agents that have the same or substantially the same 

biological effect of a known agent, such as estrogen or another hormonal combination 
of known biological effect. An "agent" could be any compound of unknown biological 
effect on genes in a given body site. Specifically, the invention relates to a method for 
identifying an agent having a desired effect on gene expression in an organ, wherein 

20 said desired effect represents a first plurality of genes differentially expressed at 
various levels, which method comprises: 
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exposing, in vivo or in vitro, organ cells to the agent; 

measuring expression levels of a multiplicity of genes in the organ cells 
exposed to the agent and organ cells without the exposure, the multiplicity being 
greater than said first plurality; 

5 determining, using a predetermined statistical significance standard, genes 

which are differentially expressed in the organ cells exposed to the agent and the 
organ cells without the exposure, the genes constitute a second plurality; and 

comparing the expression levels of genes in the second plurality with the 
expression levels of genes in said first plurality, 

1 0 wherein the agent is identified as having said desired effort if said first and second 
pluralities are the same and said expression levels in said first and second pluralities 
are substantially the same. The "organ cells" may be from any type of biological 
sample, as described above. In a preferred embodiment, such cells are from the 
kidney, pituitary gland or uterus. The "first plurality of genes" and "second plurality" of 

15 genes can be identified through a nucleotide array or filter, as described above. The 
comparing is performed using a suitable statistical technique with the assistance of 
known and commercially available programs, also as described above. 

Another embodiment relates to an agent identified by the above method. 

Yet another embodiment relates to a gene chip comprising any one or more of 
20 the above plurality of genes. 
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Pharmaceuticals and Methods of Treating 

The identification of agents that induce or repress the expression of a gene 
associated with a given disorder or condition can lead to the development of 
pharmaceuticals that can be administered to a patient at therapeutically effective 
5 doses to prevent, treat, or control such disorder or condition. 

Some conditions associated with estrogen regulation of gene expression in 
the kidney are known. For instance, in women, high estrogen levels preceding 
ovulation, during pregnancy, and resulting from estrogen administration commonly 
results in body water retention (23,24). Increased renal sodium reabsorption is a major 
1 0 mechanistic component for the elevated fluid retention (25). In rats, estrogen has been 
shown to increase thiazide-sensitive NaCI cotransporter expression levels(26), 
providing one possible molecular basis for estrogen effects on sodium retention. 

Pursuant to the methods of the invention as disclosed above and as 
exemplified in greater detail in the Examples below, two additional estrogen regulated 

15 genes that influence sodium retention were identified. First, estrogen (E2) treatment 
increased mRNA levels for NTT73 (27), which is a sodium and chloride dependent 
transporter, known to regulate sodium retention. Second, E2 treatment also induced 
mRNA levels for ABCC3, a member of a family of genes which are known to modulate 
epithelial sodium channel activity (28). The physiological role of E2 regulation of these 

20 genes may lie in the large volume expansion required during pregnancy. The ED50 
value for E2 activation of gene expression in the kidney was about 10-fold higher than 
that required for uterine weight increases (Figure 5), perhaps a mechanism to ensure 
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that normally estrogen actions only occur in the kidney when very high levels of 
estrogen are present, as during pregnancy. 

Premenopausal women survive septic shock better than comparably aged 
males while postmenopausal women have a diminished survival advantage. Since 
5 volume loss is a major cause of morbidity in shock, it is expected that enhanced 
sodium and water retention due to elevated expression of NTT73(27) by E2 plays a 
role in this protective process. Thus, one disclosed embodiment relates to a method 
for identifying an agent capable of maintaining vascular volume in septic shock 
comprising exposing, in vivo or in vitro, kidney cells to an agent, measuring expression 
10 levels of NTT73 and ABCC3 in kidney cells exposed the agent and in kidney cells not 
exposed to the agent; comparing the expression levels of the NTT73 and ABCC3 with 
the expression levels of the genes kidney cells exposed to estrogen. The candidate 
agent identified by this process can be used in pharmaceuticals for purposes of 
maintaining vascular volume in the treatment of septic shock. 

15 Another estrogen modulated gene in the kidney with biological significance is 

CYP27B1, the enzyme responsible for the rate limiting conversion of inactive 
25-hydroxy vitamin D3 into active 1 ,25-dihidroxy vitamin D (29). This process is known 
to occur in the proximal tubules of the kidney and has been shown to be stimulated by 
estrogen treatment of birds(30). Urinary calcium excretion is increased in 

20 postmenopausal women, while estrogen treatment reduces urine calcium levels (31, 
32). The presence of vitamin D receptors within the proximal convoluted tubule and 
collecting duct tubules of the kidney suggests that E2 induction of CYP27B1 is the 
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basis of this beneficial effect. 

Thus, the invention relates to a method of identifying agents that are capable 
of enhancing calcium uptake in postmenopausal women comprising exposing, in vivo 
or in vitro, kidney cells to an agent, measuring expression levels of CYP7B1 in kidney 
5 cells exposed the agent and in kidney cells not exposed to the agent; comparing the 
expression levels of the CYP7B1 with the expression levels of the genes in kidney 
cells exposed to estrogen. The agent identified by this process can be used in 
pharmaceuticals for purposes of enhancing calcium uptake in postmenopausal 
women. 

10 It is known that estrogen treatment reduces expression of 

betaine:homocysteine methyltransferase (BHMT) and S-adenosy (homocysteine 
hydrolase (SAHH), two enzymes involved in the methionine / homocysteine cycle (34). 
Elevated plasma homocysteine levels are now recognized as an important risk factor 
for the development of cardiovascular disease (35), and estrogen treatments reduced 

15 plasma homocysteine levels in postmenopausal women. Thus, the regulation of 
BHMT and SAHH provides a mechanistic link for this effect. 

Thus, one embodiment disclosed herein relates to a method of identifying 
candidate agents for treating cardiovascular disorders comprising measuring 
expression of BHMT and SAHH in kidney cells exposed to an agent and in kidney cells 
20 with such exposure, comparing the expression levels of BHMT and SAHH with the 
expression levels of the genes in kidney cells exposed to estrogen. The agent 
identified by this process can be used in pharmaceuticals for purposes of treating 
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cardiovascular disorders. 

Finally, E2 treatment induced expression of COR1 (chemokine orphan 
receptor 1, RDC1) an orphan G-protein coupled receptor (37), along with the 
guanylate nucleotide binding protein 3 (GNBP3) and the regulator of G-protein 
5 signaling 3 (RGS3), suggesting these proteins may form a functional unit. RDC1 is a 
receptor for the potent vasodilatory peptide adrenomedullin and calcitonin 
gene-related peptide, CGRP (38). Administration of CGRP to ovariectomized rats 
does not produce a decrease in kidney vascular resistance; however, in 
ovariectomized rats treated with E2 or in pregnant rats, injection of CGRP significantly 
10 decreases kidney vascular resistance (39). The observed increased expression of 
RDC1 in kidney provides a mechanism for the E2 induction of sensitivity to CGRP in 
the kidney, resulting in the large increase in renal flow seen during pregnancy (40). 

Thus, one embodiment disclosed herein relates to a method of identifying 
candidate agents for treating conditions associated with reduced renal flow, such as 

15 caused by diuretics or congestive heart failure. Toxicity and therapeutic efficacy of 
such agents identified by the above methods can be determined by standard 
pharmaceutical procedures in cell cultures or experimental animals, for example, for 
determining the LD50 (the dose lethal to 50% of the population) and the ED50 (the 
dose therapeutically effective in 50% of the population). The dose ratio between toxic 

20 and therapeutic effects is the therapeutic index and can be expressed by the ratio, 
LD50/ED50. compounds that exhibit large therapeutic indices are preferred. While 
compounds that exhibit toxic side effects can be used, care should be taken to design 
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a delivery system that targets such compounds to the site of affected tissue to 
minimize potential damage to normal cells and thereby reduce side effects. 

The data obtained from the cell culture assays and animal studies can be 
used to formulate a dosage range for use in humans. The dosage of such compounds 
5 likes preferably within a range of circulating concentrations that include ED50 with little 
or no toxicity. The dosage can vary within this range depending upon the dosage form 
employed and the route of administration. 

The pharmaceuticals of the present invention can be formulated by standard 
techniques using one or more physiologically acceptable carriers or excipients and the 
10 biologically active agent. The agents and its physiologically acceptable salts and 
solvates can be formulated and administered orally, introraly, rectally, parenteral^ 
epicutaneously, topically, transdermal^, subcutaneously, intramuscularly, intranasally, 
sublingualis intradurally, introcularly, intravenously , intraperioneally, or by inhalation. 

With regard to oral administration, the pharmaceutical compositions can take 
1 5 the form of tablets or capsules prepared by conventional means with pharmaceutically 
acceptable excipient, such as binding agents etc. Tablets may be coated according to 
methods well known in the art. Liquid preparations can be in the form of solutions, 
suspensions and syrups or can be initially in dry form for constitution with water or 
other suitable vehicle. Other additives may include suspending agents, such as 
20 sorbitol syrup, cellulose derivatives or hydrogenated edible fats, emulsifying agents or 
non-aqueous solutions. Preparations for oral administration may also be formulated 
for a time or controlled release of the active ingredient using techniques well know in 
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the art of the invention. 

Other formulations of the pharmaceuticals of the invention may be depot 
preparations for administration via implantation. 

The pharmaceutical compositions of the present invention may be presented 
5 in a pack or dispenser device that contains one or more unit dosage forms containing 
the active ingredient. The pack can for example comprise metal or plastic foil, for 
example a blister pack. The pack or dispenser would contain instructions for 
administration. 

Methods of Monitoring 

10 The identification of the plurality of genes described above provides a 

powerful tool for assessing the progression of a state, condition or treatment. 
Specifically, a plurality of genes can be identified in a patient prior to an event, such as 
menopause, surgery, the onset of a therapeutic regime, or the completion of a 
therapeutic regime, to provide a base line result. This base-line can then be compared 

1 5 with the result obtained using identical methods either during or after such event. This 
information can be used for both diagnostic and prognostic purposes. 

Kits 

Another embodiment is directed to a kit containing a plurality of genes, preferably on a 
substrate. The kit also may comprise one or more containers or packages, along with 
20 reagents, solutions and possibly instructions for use. 
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All of the cited references are herein incorporated by reference. The invention 
is further described by the following Examples, which do not limit the invention in any 
manner. 

5 Examples 

Example 1 : Introduction to Study and Animal-Related Procedures 

Introduction 

Estrogen receptors are expressed in numerous organs, although only a few 
organs are considered classical targets for estrogens. A systematic survey of estrogen 

10 regulation of approximately 10,000 genes in 13 tissues from wild type and ERpKO 
mice treated subcutaneously with vehicle or 1 7p-estradiol (E2) for six weeks was 
conducted. As expected, the uterus and pituitary had the greatest number of genes 
regulated by E2, while, surprisingly, the kidney had the third largest number of 
regulated genes. Some of these kidney regulations may provide mechanisms for 

15 known physiological effects of estrogens. For example, E2 induction of CYP27B1 , the 
rate limiting enzyme in the synthesis of 1 ,25-dihydroxyvitamin D, may explain the 
ability of estrogens to decrease urinary calcium excretion in women. In situ 
hybridizations localized E2 regulation in the kidney to the juxtamedullary proximal and 
distal collecting tubule epithelial cells in both the mouse and rat. E2 regulations in the 

20 kidney were intact in the ERpKO mice, and the ERa selective agonist propyl pyrazole 
triol acted similarly as E2, together suggesting an ERa mediated mechanism. Finally, 
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the combination of the AF1 -selective agonist tamoxifen plus mice expressing an 
AF1 -deleted version of ERa (previously designated as ERa knockouts) allowed clear 
identification of genes dependent upon ERa AF1 activity and genes dependent upon 
ERa AF2 activity. Both AF1 and AF2 dependent genes were stimulated by E2 with the 
5 same ED 5 a, indicating that sensitivity of gene regulation in the kidney depends upon 
ER ligand binding and not on the subsequent ER activation mechanisms. 

Animal-Related Procedure 

Animals-Wildtype 129 strain female mice or Sprague-Dawley rats (bred at 
Wyeth or obtained from Taconic Farms) were placed on a casein-based diet at 

10 approximately 6 weeks of age. One week later, the animals were ovariectomized. 
Commencing the day after ovariectomy, each animal received a daily subcutaneous 
treatment with vehicle (50% DMSO, 50% phosphate buffered saline) or. vehicle 
containing treatments for six weeks. Each group consisted of six or seven animals. 
Approximately 2 hours following the final treatment, the animals were euthanized with 

15 selected tissues frozen in liquid nitrogen for RNA analysis or on dry ice for histology. 

Example 2 : Preparation of Microarray 

GeneChip-Total RNA was prepared separately from each individual organ by 
using Trizol (Invitrogen) followed by subsequent repurification on Rneasy columns 
(Qiagen). In general, two pools of RNA were created using equal amounts of RNA 
20 from three mice. For small organs such as pituitary, an equal amount of RNA from six 
animals was combined. 
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Target Preparation and Array Hybridization-Total RNA was used to generate 
biotin labeled cRNA target as described (8) which was hybridized to the murine 
MGJJ74Av2 probe arrays (Affymetrix, Santa Clara, CA) for 16 h at 45°C. Eleven 
biotin-labeled cRNAs at defined concentration were spiked into each hybridization and 
5 used to convert average difference values to frequencies expressed as parts per 
million. 

Example 3 : Data Selection and Analysis 

Pairwise comparisons were made between each of the treatments. We 
calculated the fold change ratio, the p-value based on Student's West, the number of 

10 present calls, and the expression level for each comparison. A confidence score (CS) 
was defined as CS(x)= FC(x) + PV(x) + PC(x) + EL(x) where FC, PV, PC and EL are 
scores assigned to the fold change, p-value, number of present calls, and the : 
expression level, respectively. FC(x) was assigned 5 if the fold change ratio was 
greater then 1.95 and was assigned 0 if the ratio was between 1.95 and 1.5. PV(x) 

15 was assigned 3 if the p-value was less then 0.05 and was assigned 2 if the p-value was 
between 0.05 and 0.1. PC(x) was assigned 3 if at least 50% of the samples are called 
P by the Affymetrix algorithm and assigned 1 if only 25% of the samples are called P. 
EL(x) was assigned 3 if at least two samples had a frequency value of 20 or greater 
and assigned 1 if two samples only had a frequency greater then 15. Penalty points 

20 were assigned if the fold change was less then 1 .5, the p-value was greater then 0.2 or 
the frequency values were below 1 5 ppm. CS(x) ranged for -14 to 14 with qualifiers 
having a score of 14 considered the most significant changes. Genes with 11 or more 
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points in any one pairwise comparison were considered to be significant and were 
included for further analysis. Real-time PCR on individual RNA samples and histology 
analyses were performed essentially as described previously (9, 10). 

Example 4 : Discussion of Results 

5 To begin a systematic survey of estrogen receptor regulation of gene 

expression in the mouse, ovariectomized wild-type (WT) and ER(3KO mice were 
treated by daily subcutaneous administration of either vehicle or 20 (jg/kg/day 
17(3-estradiol (E2) for six weeks. RNA prepared from 13 organs was analyzed by 
microarray for estrogen regulation of gene expression. The resulting data set was 

10 queried for genes whose regulation was dependent on ERa or ERp. For ERa 
regulation, the basal expression level was predicted to be the same in WT and ERpKO 
mice, with E2 induction or suppression occurring in both WT and ERpKO mice (Figure 
1). For ERp regulation, basal expression was predicted to remain constant, with E2 
induction or suppression occurring in WT mice but not in ERpKO mice. ERa pattern 

15 regulations were found in well known estrogen target tissues such as the uterus (514 
inductions, 19 repressions), pituitary (56 inductions, 30 repressions) and bone marrow 
(3 inductions, 3 repressions). In contrast, essentially no genes could be discerned that 
fit the predicted ERp regulation pattern in any tissue. 

Surprisingly, the kidney had a very large number of genes regulated at least 
20 2-fold by E2 (26 inductions, 4 repressions; Figure 2). To further characterize E2 
regulation of gene expression in the kidney, in situ hybridization was used to localize 
E2 induction of CYP7B1 , TF, STAT5A and GADD45G In each case, induction of gene 
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expression occurred in the juxtamedullary region of the kidney (Figure 3) primarily in 
the proximal and distal tubule epithelium (not shown). Estrogen regulation of STAT5A 
and GADD45G also occurred in rat kidney juxtamedullary region (Figure 4), 
demonstrating that the estrogen responsiveness of kidney is not limited to the mouse. 

5 The ED 5 oS for E2 stimulation of CYP7B1, TF, STAT5A, and BCAT1 in the 

kidney were all very similar at about 3 pg/kg/day (Figure 5). Although this is 
approximately 10-fold greater than the ED 50 dose of E2 required for uterine weight 
increases, the ED 50 for gene induction in the uterus can vary by 20-fold, from 0.2 
ug/kg/day E2 for BCAT1 induction to 2.7 ug/kg/day for c-fos (Figure 5). The E2 
10 induction of gene expression in the kidney at the same dose as induction of well 
characterized genes such as c-fos in the uterus suggests that regulation of kidney 
gene expression occurs at physiological levels of E2. 

Confirmation of the role of ERa in the induction of kidney gene expression 
was obtained with 4-propyM ,3,5-Tris(4-hydroxy-pheny!) pyrazole (PPT) a compound 

15 which exclusively activates ERa but not ER|3 (11). Treatment with PPT induced 
expression of CYP7B1 , TF, STAT5A and BCAT1 to a similar extent as did treatment 
with E2 (Figure 6). Further, two ERp selective agonists (W-0292 and W-0070, both 
approximately 75-fold selective for ERp compared to ERa by in vitro binding assays; 
data not shown) failed to stimulate expression of any of these four genes (Figure 6). 

20 Finally, in agreement with previous results, ERa mRNA was detectable within the 
mouse kidney (Figure 7). The regulation of these genes in WT and ERpKO mice, the 
similar E2 ED 50 for each gene, the activity of a selective ERa agonist, the inactivity of 
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selective ERP agonists, and the expression of ERa within the kidney together suggest 
a single, ERa mediated pathway for regulation of these genes. 

It has been recognized that a commonly utilized strain of ERaKO mice (12) in 
fact expresses an ERa protein lacking only AF1, due to alternative splicings of the 
5 exon containing the targeted knockout mutation(13, 14). The resulting truncated ERa 
proteins, referred to here as AAF1-ERa, have the ability to stimulate expression of a 
synthetic estrogen response element driven promoter (14). As found previously for 
ERaKO mice, the level of this misspliced transcript in the uterus of ERaERpKO mice 
was lower than the level of full length message in WT mice (Figure 7). Again as 
1 0 expected, the amount of intact ERa mRNA was much lower in the whole kidney than in 
uterus from WT mice. However, the level of AAF1-ERa mRNA was actually greater in 
the ERaERpKO kidney than was intact ERa mRNA in WT kidney. No ERp mRNA 
could be detected in either uterus or kidney from the ERaERpKO mice. 

The presence of AAF1-ERa at significant levels in the kidney allows 
1 5 determination of the relative contribution of AF1 and AF2 to E2 regulation of individual 
genes. To determine whether AF1 or AF2 regions of ERa were required for induction 
of CYP7B1 , TF or BCAT1 in the kidney, WT, ERaERpKO or ERaKO mice were treated 
with E2 or the AF1 selective agonist tamoxifen (15). Expression of CYP7B1 was 
induced by E2 but not by tamoxifen in WT mice (Figure 8), suggesting that induction of 
20 CYP7B1 occurred through AF2. Consistent with this hypothesis, E2 also increased 
CYP7B1 expression in mice expressing AAF1-ERa (either ERaERpKO or ERaKO 
mice). This E2 induction was blocked by an excess of ICI-1 82780, confirming the 
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regulation occurred through ER. Together, these two lines of evidence suggest that 
induction of CYP7B1 is an AF2 dependent process. In contrast, TF expression was 
induced by both E2 and tamoxifen in WT mice. Neither compound could induce TF 
expression in ERaERpKO mice (which express only AAF1-ERa). This suggests that 
5 induction of TF occurs through an AF1 mediated pathway. Finally, BCAT1 was also 
induced by both E2 and tamoxifen in WT mice. However, in ERaERpKO mice, E2 
stimulated BCAT1 expression but tamoxifen did not. These results suggest that 
BCAT1 expression can be stimulated through either AF1 or AF2 mechanisms. In WT 
mice, tamoxifen stimulates expression through AF1 only. Since AAF1-ERa lacks the 
10 AF1 region necessary for tamoxifen activity, tamoxifen cannot stimulate BCAT1 
expression in ERaERpKO mice. In contrast, E2, which can stimulate BCAT1 
expression through either AF1 or AF2, can still stimulate expression in the AAF1-ERa 
expressing mice. 

Estrogen receptors a or p are found in almost all organs of the body, yet 
1 5 relatively few tissues are considered targets for estrogen action. To begin to develop a 
more complete understanding of estrogen biology, estrogen responsive genes in 13 
tissues from WT and ERpKO mice were characterized. In general, many tissues 
showed patterns of E2 regulation consistent with an ERa mechanism, including such 
known target organs as uterus, pituitary, and bone. Surprisingly, no E2 regulations 
20 were found that fit the expected pattern for ERp regulations. This was true even in 
organs expressing moderately high levels of ERp such as the bladder and lung (7). At 
least three mechanisms could explain the lack of detection of expected ERp 
responses. First, it has been proposed that a major function of ERp is to modulate the 
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activity of ERp(16). For example, expression of the Ki-67 protein was constitutively 
elevated in uterus of ER0KO mice, i.e. in the ERBKO mice its expression was always 
equivalent to the E2 stimulated levels in WT animals (17). The survey criteria used 
here would not detect this pattern. Further analysis of these data has revealed many 
5 genes in multiple tissues which also have this "nonclassical" pattern of regulation 
whereby expression is constitutively elevated in both vehicle and E2 treated ER0KO 
mice (data not shown). Second, analysis of whole organs may easily miss regulations 
occurring in only selected cell subtypes within an organ. For example, initial analysis 
of kidney did not identify GADD45G as being regulated by E2, because GADD45G 
10 expression is regulated only in tubule epithelial cells. The unregulated expression of 
GADD45G throughout most of the kidney sufficiently diminished the fold induction so 
as to be less than 2-fold in whole organ samples. The combination of laser capture 
microdissection with microarray technology (18) should allow detection of ER|3 
regulated genes with a classical pattern of regulation. 

15 

This global survey demonstrates that, unexpectedly, the kidney had a very 
large number of regulated genes. Both genetic approaches (Figure 2) and 
pharmacological approaches (Figure 6) demonstrated that E2 regulation in the kidney 
was mediated through ERa. Expression of CYP7B1, TF, STAT5A, and even genes 
20 such as GADD45G which are expressed throughout the kidney showed regulation 
only in tubule epithelium (Figure 3). Additionally, KIM-1 , the rat counterpart of mouse 
TIM1 and TIM2 (20) is also expressed in proximal tubule epithelial cells (21). Finally, 
3 H-E2 binding localizes to proximal tubule cells following administration to rats(22). 
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Together, these results suggest that ERa directly regulates gene expression in tubule 
epithelial cells. 

Although the observed regulations in the kidney were mediated by ERa, the 
mechanism of activation of gene expression by ERa was gene specific. Thus studies 
5 using tamoxifen, which activates ERa through AF1, along with studies using 
AAF1-ERaKO mice (previously designated as ERKO mice) together indicate that E2 
induction of CYP7B1 expression occurred predominantly through an AF1 -dependent 
mechanism, E2 induction of TF expression occurred predominantly through an 
AF2-dependent mechanism, and E2 induction of BCAT1 expression occurred through 

1 0 both AF1 and AF2 mechanisms (Figure 8). The ED 5 o values for E2 stimulation of these 
three genes were all very similar (Figure 5). Thus, whether a gene is induced through 
either AF1 or AF2 mechanism does not influence the sensitivity of the gene in the 
kidney to plasma estrogen levels. Rather, the data indicate that the binding of E2 to 
ERa would be the rate limiting step in induction of gene expression in the kidney. The 

1 5 maximal fold regulation varied between genes and may depend upon whether and 
AF1 or AF2 dependent pathway is utilized. 

Analysis of 10 kb of upstream putative promoter sequences of E2 induced 
genes identified good matches to the consensus estrogen response element (ERE) in 
only a few genes, although ERE half-sites could be identified in most promoters. Many 
20 of these genes may be activated through nonclassical ERa mechanisms such as the 
combination of an ERE half-site with Sp1 binding sites (41). It is unlikely that a 
nonclassical ERa / AP1 stimulatory mechanism is responsible for these regulations, 
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since IC1 182780 functions as a partial agonist in this mechanism (42) while IC1 182780 
was a complete antagonist for E2 regulation of gene expression in the kidney (Figure 
8). Additionally, E2 induced expression of the transcription factors PHD2, ELF3, 
STAT5A and STAT5B. It is possible that E2 induction of these transcription factors 
5 resulted in the subsequent increase in expression of the remaining genes. For 
example, CIS2 is a known target for induction by STAT transcription factors (43), 
suggesting that the E2 induction of CIS2 is mediated indirectly through the E2 
induction of STAT5A and STAT5B. 
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MeanWTE2 
Fold Change 


2.32 


2.31 


2.31 


CO 

cm' 


co 

CNI 


co 
cm' 


2.29 


2.29 


2.29 


2.28 


2.28 


2.28 


2.28 


2.27 


2.27 


2.26 


2.24 


2.24 


1 2.23 


2.23 


2.21 


I 2.21 


2.21 


CM 
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Tissue 


Uterus | 


Uterus 


Uterus 


Uterus 


Uterus | 


Uterus 


Uterus 


Uterus 


Uterus 


Uterus 


Uterus 


Uterus 


Uterus 


Uterus 


| Uterus 


Uterus 


| Uterus 


I Uterus 


lUterus 


Uterus 


| Uterus 


Uterus 


I Uterus 


Gene Name 


sex comb on midleg-like 1 (Drosophila) I 


ESTs, Moderately similar to unnamed protein product 
[H.sapiens] 


DNA segment, Chr 2, University of California at Los Angeles 1 


ESTs, Moderately similar to G3P MOUSE 
GLYCERALDEHYDE 3-PHOSPHATE DEHYDROGENASE 
[M.musculus] 


vimentin ! 


lymphocyte antigen 57 


nuclear receptor co-repressor 2 


neural precursor cell expressed, developmentally down- 
regulated gene 4 


cofilin 1, non-muscle 


cytidine 5'-triphosphate synthase 


ATPase, H+ transporting, lysosomal (vacuolar proton pump), 
42kDa 


ESTs, Highly similar to COP-COATED VESICLE MEMBRANE 
PROTEIN P24 PRECURSOR [Cricetulus griseus] 


signal transducer and activator of transcription 3 


1 

ESTs, Weakly similar to AF077034 1 HSPC010 [H.sapiens] 


i capping protein beta 1 


dynactin 3 


I histocompatibility 2, T region locus 23 


S-adenosylhomocysteine hydrolase 


coagulation factor Hi 


ESTs, Weakly similar to myelin transcription factor 1-like 
[M.musculus] 


interferon regulatory factor 7 


RIKEN cDNA 1810009M01 gene 


I ubiquitin-activating enzyme E1, Chr X 


Pub Name 


SCML1 


UNK AW230209 


D2UCLA1 


UNK AI835771 


VIM ! 


LY57 


NCOR2 


NEDD4 


CFL1 


CTPS 


ATP6D 


UNK AW1 20643 


STAT3 


UNK AI847697 


CAPPB1 


DCTN3 


H2-T23 


AHCY 


co 
u. 


UNK AA764261 


IRF7 


1810009M01RIK 


UBE1X 


NUP50 


Mousedata. 
Qualifier 


103399 at 


94327 at 


96345 at 


97751 f at 


101047 at 


100772 p a 


95128 at 


93101 s at 


99119 at 


103341 at 


92603 at 


96708 at 


99100 at 


95105 at 


95142 s at 


94522 at 


98472 at I 


96025 g at 


97689 at 


104533 at 


104669 at 


97885 at 


92616 at 


93046 at 
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